IDO: Intelligent Data Outsourcing with Improved RAID Reconstruction Performance in Large-Scale Data Centers

نویسندگان

  • Suzhen Wu
  • Hong Jiang
  • Bo Mao
چکیده

Dealing with disk failures has become an increasingly common task for system administrators in the face of high disk failure rates in large-scale data centers consisting of hundreds of thousands of disks. Thus, achieving fast recovery from disk failures in general and high online RAID-reconstruction performance in particular has become crucial. To address the problem, this paper proposes IDO (Intelligent Data Outsourcing), a proactive and zone-based optimization, to significantly improve on-line RAID-reconstruction performance. IDO moves popular data zones that are proactively identified in the normal state to a surrogate set at the onset of reconstruction. Thus, IDO enables most, if not all, user I/O requests to be serviced by the surrogate set instead of the degraded set during reconstruction. Extensive trace-driven experiments on our lightweight prototype implementation of IDO demonstrate that, compared with the existing state-of-the-art reconstruction approachesWorkOut and VDF, IDO simultaneously speeds up the reconstruction time and the average user response time. Moreover, IDO can be extended to improving the performance of other background RAID support tasks, such as re-synchronization, RAID reshape and disk scrubbing.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

WorkOut: I/O Workload Outsourcing for Boosting RAID Reconstruction Performance

User I/O intensity can significantly impact the performance of on-line RAID reconstruction due to contention for the shared disk bandwidth. Based on this observation, this paper proposes a novel scheme, called WorkOut (I/O Workload Outsourcing), to significantly boost RAID reconstruction performance. WorkOut effectively outsources all write requests and popular read requests originally targeted...

متن کامل

Failure Recovery Issues in Large Scale, Heavily Utilized Disk Storage Systems

Large data is increasingly important to large-scale computation and data analysis. Storage systems with petabytes of disk capacity are not uncommon in high-performance computing and internet services today and are expected to grow at 40-100% per year. These sizes and rates of growth render traditional, single-failure-tolerant (RAID 5) hardware controllers increasingly inappropriate. Stronger pr...

متن کامل

Identification of Pattern used in Determination of Critical Success Factors in ITS Projects, Case Study: Road Maintenance and Transportation Organization

One of the risks recognized by relevant authorities is the risk of outsourcing ITS projects. The purpose of this study was to design and explain the pattern of determining the critical success factors in outsourcing large-scale ITS projects in the Ministry of Roads and Urban Development (Road Maintenance and Transportation Organization). This study was performed using qualitative method. The pa...

متن کامل

Exploiting redundancy to construct energy-efficient, high-performance RAIDs

Recent studies show that disk-based I/O subsystems account for a non-trivial portion of energy consumption in data-intensive environment such as storage servers and data centers. Previous powerefficient I/O solutions for a single disk drive or mobile computers cannot be applied to data-intensive environment where the I/O load is much more intensive. Current solutions seek help from multi-speed ...

متن کامل

PRO: A Popularity-based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems

Hong Jiang began his talk by discussing the importance of data recovery. Disk failures have become more common in RAID-structured storage systems. The improvement in disk capacity has far outpaced improvements in disk bandwidth, lengthening the overall RAID recovery time. Also, disk drive reliability has improved slowly, resulting in a very high overall failure rate in a large-scale RAID storag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012